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M-CAT elements mediate both muscle-specific and non-muscle-specific transcription. We used artificial 
promoters to dissect M-CAT elements derived from the cardiac troponin T promoter, whose regulation is 
highly striated muscle specific. We show that muscle-specific M-CAT-dependent expression requires two 
distinct components: the core heptameric M-CAT motif (5'.CATTCCT-3'), which constitutes the canonical 
binding site for TEF-l-related proteins, and specific sequences immediately flanking the core motif that bind 
an additional factor(s). These factors are found in higher-order M-CAT DNA-protein complexes with TEF-1 
proteins. Non-muscle-specific promoters are produced when the sequences flanking the M-CAT motif are 
removed or modified to match those of non-muscle-specific promoters such as the simian virus 40 promoter. 
Moreover, a mutation of the 5'-flanking region of the cardiac troponin T M-CAT-l element upregulated 
expression in nonmuscle cells. That mutation also disrupts a potential E box that apparently does not bind 
myogenic basic helix-loop-helix proteins. We propose a model in which M-CAT motifs are potentially active in 
many cell types but are modulated through protein binding to specific flanking sequences. In nonmuscle cells, 
these flanking sequences bind a factor(s) that represses M-CAT-dependent activity. In muscle cells, on the 
other hand, the factor(s) binding to these flanking sequences contributes to both the cell specificity and the 
overall transcriptional strength of M-CAT-dependent promoters. 



M-CAT elements are transcriptional regulatory motifs with 
the canonical sequence 5'-CATTCCT-3' (8, 14, 37) that are 
required for the cell-specific transcription of the cardiac tro- 
ponin T (cTNT) gene in both embryonic skeletal and cardiac 
muscle (21, 36-38). Recently, M-CAT elements have been 
implicated in the cell-specific regulation of many cardiac pro- 
moters (15, 20, 27, 29, 33, 35, 40, 41, 45, 48, 55), Despite the 
clear role for M-CAT elements in striated muscle-specific tran- 
scription, several nonmuscle promoters have also been shown 
to be governed by M-CAT elements. The simian virus 40 
(SV40) enhancer is regulated in part by three binding motifs, 
dubbed GTIIC (5'-CATrCCA-3'), SphI (5'-CATGCTT-3'), 
and Sphll (5'-CATACTT-3') (11, 12, 60), that differ from the 
canonical M-CAT motif at nucleotide positions that were 
shown to result in functional M-CAT motifs in the context of 
the cTNT promoter (14). The SV40 enhancer and the closely 
related human papillomavirus type 16 E6 and E7 enhancer are 
active in a variety of nonmuscle cell lines such as HeLa cells, 
P19 and F9 teratocarcinoma cells, and keratinocytes (11, 16, 
22, 42). M-CAT-binding sites are active in two-cell mouse 
embryos and ES cell lines (39). In addition, the M-CAT-de- 
pendent somatomammotropin promoter directs cell-specific 
expression in placental cells (25). Until now, it was not known 
how M-CAT elements might control muscle-specific expres- 
sion on the one hand and nonspecific expression on the other. 

It is not uncommon for similar binding motifs to mediate 
different effects in different cell types. For example, E boxes 
(5'-CANNTG-3') govern muscle-specific transcription through 
binding of members of the myoD family of basic helbc-loop- 
helix (bHLH) transcription factors (collectively termed MDFs) 
that are themselves muscle specific (13, 43, 56). Likewise, in 
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Drosophila development, very early neuronal specification is 
achieved through the action of bHLH proteins of the achaete- 
scute complex that, at least at this stage of development, are 
found only in these proneuronal cells (reviewed in reference 
23). In B lymphocytes, however, E-box sites activate lymphoid 
cell-specific expression through the binding of ubiquitous 
bHLH factors such as the E2A gene products (2, 61). The 
specificity in this case may be generated by the combined 
effects of slightly increased E2A gene product concentration 
and a developmentally programmed decrease in the concen- 
tration of the negative-regulator HLH protein, Id (reviewed in 
references 3 and 26). Lymphoid cell-specific displacement of 
the ubiquitous repressor protein ZEB may also play a part 
(17). 

Similarly, octamer sites can direct either cell-specific or 
ubiquitous expression. The ubiquitous POU-domain protein 
Ocl-1 activates ubiquitous small nuclear RNA and hislone 
H2B gene transcription via octamer sites, Octamer-dependent 
lymphoid cell-specific immunoglobulin gene activation is likely 
to involve both Oct-1 and the closely related but lymphoid 
cell-specific Oct-2 protein, probably in collaboration with a 
lymphoid cell-specific coactivator, OCA-B (10, 34). However, 
the transcriptional activating properties of Oct-1 and Oct-2 are 
distinct, and Oct-2 is absolutely required for later stages of 
B-cell differentiation (9, 10, 30, 53, 54). Thus, cell-specific gene 
regulation in muscle, neuronal progenitors, and B lymphocytes 
can be mediated by common DNA-binding motifs and is 
achieved by the binding of different factors or combinations of 
factors and/or cofactors in the different tissues. 

M-CAT elements bind TEF-1 or TEF-l-related proteins 
both in muscle extracts and in nonmuscle extracts (14, 20, 22, 
27, 35, 40, 48, 49, 58), with the exception of the placenta- 
specific somatomammotropin M-CAT site (24). Recent exper- 
iments show that there are multiple TEF-l-rclated genes and 
that mRNAs from two of these TEF-l-related genes show 
distinct tissue distributions, one enriched in skeletal muscle 
and the other enriched in cardiac muscle (la, 51). Moreover, 
we have demonstrated that a muscle-specific complex is 
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formed on M-CAT elements. This complex contains a TEF-1 
protein that is muscle enriched but is also found in nonmuscle 
tissues (14a). Thus, it is probable that TEF-1 proteins will 
provide a measure of tissue specificity by virtue of their muscle 
enrichment and participation by as yet undefined interactions 
in muscle-specific complexes on the core M-CAT motif. 

Here, we report experiments that address the mechanism(s) 
controlling the tissue-specific activity of M-CAT elements. M- 
CAT elements were studied independently, in artificial pro- 
moters, to test their capacity to mediate tissue specificity in the 
absence of other promoter elements. We show that the canon- 
ical 7-nucleotide M-CAT core motif that constitutes the TEF- 
1-binding site is insufficient alone to mediate muscle specificity. 
However, the sequence flanking the cTNT promoter M-CAT 
core motifs strongly contributes to tissue specificity by repress- 
ing M-CAT-dcpendent promoter activity in nonmuscle tissues, 
coincident with the appearance of a second binding activity. 
The sequence-specific binding of nuclear factors to these flank- 
ing sequences implicates additional factors that, in collabora- 
tion with TEF-1 and TEF-l-related proteins, form a muscle- 
specific transcription complex. 

Note on nomenclature. For the purpose of this article, the 
heptameric M-CAT sequence itself (CATTCCT) will be re- 
ferred to as the core motif, while "M-CAT element" refers to 
this core motif surrounded by native flanking sequence as 
defined by footprint (38), or variants of this sequence. Further- 
more, the M-CAT-binding factors in muscle nuclear extracts 
are composed of the products of several different TEF-l-re- 
lated genes (la, 14a). In the interest of brevity, TEF-1 and 
TEF-l-related proteins, recognized by a polyclonal antibody 
raised against a common epitope, are hereafter collectively 
referred to as TEF-1 proteins. 

MATERIALS AND METHODS 

Tissue culture and CAT assay. All tissues were harvested from embryonic day 
12 chicks. Fibroblasts were isolated from skin that was peeled off the breast, 
flanks, and leg. Breast muscle and livers were dissected free of the connective 
tissue and gallbladder, respectively. After being minced, the tissues were washed 
in scrum-frcc medium and disrupted in 0.25% trypsin with periodic pipetting. 
When discrete pieces of tissue were no longer visible (3 to 10 min, depending on 
the tissue), the cells were pelleted, resuspendcd in medium, filtered through 
20-tJLm-porc-sizc nylon mesh, and counted. Muscle cells were plated at 1.25 x 10" 
cells per 60-mm-diametcr collagen-coated plate. Liver cells (consisting of a 
mixed population of fibroblasts and hepattK-ylcs) were plated at 1.5 x 10'' cells 
per 60-mm collagen-coaled plate. Skin-durived fibroblast cells were plated at 
0.75 X 10" cells per uncoaied plate. The cells were allowed to attach for 24 h and 
then iransfccted by standard calcium phosphate methods (19. 47). The cells were 
exposed to the transfcction cocktail for 16 h before the medium was changed. At 
48 h later, cells were harvested and lyscd by three freeze-thaw cycles. Chloram- 
phenicol acciyltransferasc (CAT) activity was assayed by standard techniques 
(47). 

Piasmid construction. All oligonucleotides were obtained from Operon Tech- 
nologies, Alameda, Calif., and designed with 5' cohesive termini to enable them 
to multimcrize only head to tail (see Table 1). Oligonucleotides were treated 
with kinase, mixed with various proportions of non-kinase-treated oligonucleo- 
tide (to limit the multimerization reaction), and ligated. After ligation overnight, 
overhanging ends were filled with Klenow polymera.se. Multimers in the appro- 
priate size range were isolated from a 6% nundenaturing polyaciylamidc gel, 
purified, and ligated into -49cTNT.CAT, into the Clal site in the polylinker just 
upstream of the cTNT TATA (38) (see Fig. lA, 2A, etc., for diagrammatic 
representations of the resultant multimers). All constructs were sequenced to 
verify the multimer copy number, sequence and orientation. All multimers pre- 
sented here contain either five or six element copies. Variation of the copy 
number from 4 to 6 did not affect relative efficiency (31a). However, it was noted 
that multimers in the sense orientation relative to the TATA box produced less 
efficient promoters in muscle cells than did those multimerizcd in the antiscnsc 
orientation, although orientation did not affect the activity of derived promoters 
in the nonmuscle tissues tested (31a). M-CAT-l/Emtcat is the only promoter 
presented in this study in which multimers are in the sense orientation, since they 
were never derived in the antisense orientation despite multiple attempts. 

Nuclear extract preparation, gel retardation assay, and Western analysis of 
retarded complexes. Nuclear extracts were prepared from skeletal muscle (breast 
and leg), heart, lung, and brain tissue from embryonic day 12 chickens as de- 



scribed previously (38). High-resolution gel retardation assays were performed as 
described previously (14a) with 0.3 ng of radiolabcllcd probe. 4 to 6 jxg of nuclear 
protein extract, 1 jig of poly(dl-dC), and a 250.fold excess of competitor DNA. 
where appropriate (unless otherwise noted in the figure legends). Anti-mouse 
myoD antibodies were purchased from Santa Cruz Biotechnology, Santa Cruz, 
Calif., and gel retardation supershifl analysis was performed by preincubating the 
binding- re act ion mixtures for 15 min at room temperature with 6 jig of affinity- 
purified antiserum prior to addition of radiolabelled probe. After addition of the 
probe, the binding reaction was continued as usual with a 20-min room temper- 
ature incubation. Western blot (immunoblot) analysis of DNA-protein com- 
plexes was performed as described previously (14a). 

Methylation interference assay. Mcthylation interference was performed with 
dimethyl sulfate (Fluka) essentially as described previously (I) with the following 
modification. Following a fivefold scale-up of the gel retardation incubation 
mixture, the entire gel was transferred to a DEAE membrane with a semidry 
blotter for 1 to 2 hours in 2.5 x Tris-borate-EDTA (TBE) and the membrane was 
exposed to film for 3 to 12 h. Following visualization and excision of the required 
bands, DNA was eluted by incubating the DEAE strips at 52X for 1 h in 
high-salt elution buffer (1 M NaCI, 50 mM Tris • HCl [pH 8], 10 mM EDTA), 
extracting the DNA with phenol-chloroform, and precipitating it with ethanol. 
The DNA was cleaved in 1 M piperidine (Fluka) and purified by multiple rounds 
of lyophilization as described previously (1). Cleaved ladders were resolved on a 
16% denaturing polyacrylamidc gel. 

RESULTS 

Multimerized M-CAT elements direct striated-muscle-spe- 
cific reporter gene expression. Striated-muscle-specific expres- 
sion of the cTNT promoter requires two M-CAT elements that 
are found within 99 bases upstream of the transcription initi- 
ation site (36, 37). The distal M-CAT element (M-CAT-1) and 
the proximal M-CAT element (M-CAT-2) are separated in the 
native promoter by a center-to-center spacing of 23 nucleo- 
tides, and perturbation of this spacing reduced promoter ac- 
tivity (37). Mutation of either M-CAT element abolishes pro- 
moter activity, indicating that a single M-CAT is insufficient to 
drive expression in the context of the native promoter. We 
created artificial promoters containing multiple copies of ei- 
ther M-CAT-1 or M-CAT-2 to study the contribution of each 
site, in isolation, to tissue specificity and efficiency of expres- 
sion (Fig. 1). In these artificial promoters, the flanking se- 
quences surrounding each M-CAT element included those 
previously shown to be protected during DNase I footprinting 
(38). The constructs also preserved the native 23-bp center-to- 
center spacing between M-CAT elements (38). The tissue 
specificity of these artificial promoters was assessed by trans- 
fection into primary cultures of skeletal muscle, skin fibro- 
blasts, and hepatocytes. 

In this multimeric form, the M-CAT-1 element directs very 
high-level striated-muscle-specific expression (Fig. IB). M- 
CAT-l-dependent reporter activity is 183-fold higher in mus- 
cle than liver cells and 284-fold higher in muscle than in skin 
cells. Multimerization of the M-CAT-2 element results in a 
much weaker promoter (note the scale in Fig. IC) but one that 
retains the ability to direct muscle-specific reporter activity 
(9.6-fold higher than in liver, undetectable in skin). The activity 
of M-CAT-1 or M-CAT-2 multimers is eliminated by mutation 
(Table 1) of the core M-CAT motif (data not shown). Thus, in 
multimeric form, both promoter elements carry inherent mus- 
cle specificity but show substantial quantitative diiferences in 
the levels of promoter activity they support. 

Tissue specificity is dependent on the configuration of the 
multimerized M-CATs. Earlier studies showed that the mul- 
timerized SV40 GTIIC element (CATTCCA) supported ex- 
pression in a range of cultured cell lines of nonmuscle origin 
(16). This multimerization differed from the M-CAT multi- 
mers described above not only in one base change at position 
7 in the core motif but also in the configuration of the mul- 
timers. The GTIIC elements were multimerized in tandem 
pairs separated by 2 bp (cenier-to-center spacing, 9 bp) and 
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FIG I cTNT promoicf M-CAT elements, when mullimcrizcd. can confer muscle specificity upon a basal promoter. (A) The sequences of the 23-hp oligonucle- 
otides awe ring hHonsensus M.CAT elements, M-CAT-l and M.CAT.2, and their flanking sequences derived from the cTNT promoter are f 
ci^oUdS^^^^^ and inserted, in the antisense orientation, directly upstream of the cTNTTATA box linked to the CAT reporter gene, to yield M-CAP-lcai 

S-cXt to^^^ M-CAT elements are shown here in the sense orienlaiion for lcgibilil>^ (B) M^AT-leat directs mysclcvspccfic reporter gene expression 

n prhna^ culture of s^^ muscle (M). embryonic skin fibroblasts (F), and liver cells (L). Reporter activity was nom,ali«:d to RSVcat activity tn each cdl type. 
" Weal is shown for comparison in the top panel. Error bars corresptmd to the standard error of the mean of s« or more indepcndcm iransfcct.ons. in all figures. 
(CrM-CAT-2cai expression is very much weaker than M-CAT-Icat expression (note the scale* in panels B and C) but is also muscle specilic. 



with each pair then separated by 29 bp (16). This tandem 
arrangement resembles the spacing of the diverged M-CAT 
variants Sphl and Sphll found in the SV40 enhancer. 

We created multimcrs with the core muscle M-CAT motif 
(CATrCCT) but configured as above, with a 9-bp center-to- 
centcr spacing (see Table 2), and inserted them into artificial 
promoters as described previously (M-CATx2cai (Fig. 2A1). 
This artificial promoter directs moderate to low-level expres- 



sion in all cell types tested (Fig. 2B, top panel). The number of 
copies, from six to eight, did not significantly affect the gross 
activity or tissue specificity of the resultant chimeric promoter 
(data not shown). Next, we investigated the contribution of 
correct flanking sequence to the muscle specificity of muUim- 
erized M-CATs. We multimerized a consensus M-CAT motif 
(CATTCCT) to give 23-bp center- to-center spacing but with 
the flanking sequence derived from the sequence surrounding 
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to RSVcat, CAT activity in muscle cells was twice that in liwr cells and four times that in skin tibrohliists. 
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the GTIIC element in the SV40 early enhancer. As before, this 
multimcr was cloned upstream of the cTNT TATA (M-CAT/ 
SV40cat (Fig, 2A]). Figure 2B (bottom panel) shows that this 
chimeric promoter directs eificient reporter gene expression in 
all three cell types. Thus, muscle-specific expression is depen- 
dent upon the presence and sequence of the DNA flanking the 
core M-CAT motifs. 

The 5'-flanking region of the M-CAT-1 element mediates 
tissue-specific repression. The 5'-flanking region of the M- 
CAT-1 element contains a potential E box (38), which is pre- 
dicted to be a low-affinity MDF-binding site by comparison 
with the consensus derived from authentic MDF-dependent 
promoter comparison and various PCR techniques (5, 57, 59). 
Deletion of this E box decreased promoter activity modestly 
but did not affect tissue specificity (38). To examine the role of 
this E box in M-CAT-1 multimers, an M-CAT-1 element with 
an E-box mutation was used to construct an artificial promoter 
as described above ( M-CAT- 1/Emtcat [Fig. 2C]). Mutation of 
three bases within the E box in the M-CAT-1 oligomer greatly 
increased expression in skin fibroblasts and liver (Fig. 2D), 
Thus, mutation of the E box or some overlapping site appears 
to relieve a repression mechanism that blocks expression in 
nonmuscle ceils. Since MyoD family members are not found in 
these nonmuscle tissues, the factor(s) responsible for this re- 
pression must be either a non-MDF E-box-binding factor or a 
factor which recognizes an overlapping site. 

Multimerization of M-CAT-l/Emt produced a promoter 
substantially weaker in muscle (Fig. 2D) than its wild-type 
equivalent (M-CAT-lcat; Fig. IB, middle panel). However, we 
believe that the relatively low activity of the M-CAT-l/Emt 
artificial promoter in muscle reflects at least in part the sense 
orientation of the multimers in this construct (see Materials 
and Methods). Thus, the mutation of three bases in the region 
of the E box relieves repression in nonmuscle tissues and may 
affect activity in muscle cells. 

Gel retardation analysis of the M-CAT elements. Gel retar- 
dation analysis was performed to examine the binding of pro- 
teins to the flanking regions of each M-CAT element. The com- 
plexes resulting from the incubation of the 23-bp M-CAT-1 
element with embryonic skeletal muscle extract are shown in 
Fig. 3A. The three high-mobility complexes labeled CI, C2, 
and C3 are described in detail elsewhere (14a) and are gener- 
ated by the interactions of three distinct TEF-1 polypeptides 
with the core M-CAT motif (Fig. 3A, lane 1). These complexes 
are competed for by a 250-fold excess of any unlabeled oligo- 
nucleotide containing an intact M-CAT core motif, although 
the affinities vary (Fig. 3A, lanes 2, 4, 6, 7, 9, and 10). M-CAT 
mutant DNA and MCK E-box DNA fail to compete for these 
major complexes (Fig. 3A, lanes 3, 5, and 8) indicating se- 
quence-specific DNA-protein binding. 

In addition to these major DNA-protein complexes, two 
lower-mobility complexes (LMC lA and LMC IB) are gener- 
ated with the M-CAT-1 probe (Fig. 3A, lane 1). These com- 
plexes are self-competed for (Fig. 3A, lane 2). LMC IB (and to 
a lesser extent, LMC lA) is also competed for by the 
M-CAT- Im element, in which the core M-CAT motif is mu- 
tated, leaving only the flanking sequence intact (lane 3). How- 
ever, neither LMC lA nor LMC IB can be competed for by 
M-CAT-2 elements whether the core motif is intact or mutated 
(lanes 4 and 5). Similarly, the oligonucleotide containing the 
core M-CAT motif surrounded by SV40 enhancer flanking 
sequence is unable to compete for either LMC lA and LMC 
IB (lane 6). A chimeric M-CAT element, containing the 5'- 
flanking sequence of the M-CAT-1 element and the 3'-flanking 
sequence of the M-CAT-2 element, competes for both LMC 
I A and LMC IB (M-CAT- 1/2; lane 9), but the reverse chimera 



cannot (M-CAT-2/1; lane 10). We conclude that it is the 5'- 
flanking sequence of M-CAT-1 that is most important in the 
formation of LMC lA and LMC IB. 

Gel retardation analysis of the M-CAT-2 probe in the pres- 
ence of embryonic skeletal muscle nuclear extract reveals a 
very similar situation (Fig. 4). There are, as before, three major 
retarded complexes of relatively high mobility that are gener- 
ated by TEF-1 polypeptides bound to the core M-CAT motif 
(CI, C2, and C3 in Fig. 4). These appear to be identical in 
mobility to those formed on the M-CAT-1 probe, and, indeed, 
these probes cross-compete for CI, C2, and C3 (Fig. 3 A, lane 
4; Fig. 4: lane 2). As judged by such competition experiments, 
the M-CAT-1 element constitutes a higher-affinity binding site 
than does the M-CAT-2 element. These major complexes are 
also competed for by a 250-fold excess of any competitor DNA 
containing an intact M-CAT core motif (Fig. 4, lanes 2, 4, 6, 7, 
9, and 10). Competitor DNA containing a mutation within the 
core M-CAT motif fails to compete for any of CI, C2, or C3 
(lanes 3 and 5), as does an unrelated DNA sequence (the MCK 
E-box [lane 8]). 

M-CAT-2 interaction with embryonic skeletal muscle nu- 
clear extract also formed sequence-specific low-mobility com- 
plexes (Fig. 4, LMC 2A and LMC 2B). LMC 2A is a doublet 
whose component bands apparently have identical sequence 
specificity and tissue distribution. LMC 2A can be competed 
for only by wild-type M-CAT-2 oligonucleotide with its intact 
flanking sequences (Fig. 4, lane 4). However, because LMC 2A 
is formed only when the non-cTNT-derived cohesive termini 
(Table 1) of the M-CAT-2 23-bp oligonucleotide are filled with 
Klenow polymerase (31a), the significance of LMC 2A is un- 
certain and we; do not consider it further here. 

LMC 2B, by contrast, appears to be an authentic equivalent 
of the low-mobility complexes formed with the M-CAT-1 ele- 
ment. LMC 2B is competed for by competitor DNAs that 
include the M-CAT-2-flanking sequence, irrespective of 
whether the M-CAT motif itself is intact or mutated (Fig. 4, 
lanes 4 and 5). LMC 2B is not competed for by M-CAT-1 
elements, whether their core motif is intact or mutated (Fig. 4, 
lanes 2 and 3), indicating that different proteins probably in- 
teract with the flanking regions of the M-CAT-1 and M-CAT-2 
elements. As is the case for LMC lA and LMC IB, an oligo- 
nucleotide containing the core M-CAT motif surrounded by 
SV40 enhancer flanking sequence is unable to compete for 
LMC 2B (lane 6). A chimeric M-CAT element containing the 
M-CAT-2 5'-fianking region and the M-CAT-1 3'-flanking re- 
gion can compete (M-CAT-2/1 (lane 10]), but the reverse chi- 
mera cannot (M-CAT-1/2 [lane 9]). Thus, LMC 2B depends 
upon the 5 '-flanking sequence of M-CAT-2 for its formation. 

The M-CAT variants M-CAT/SV40 and M-CAT-l/Emt 
were radiolabelled and subjected to gel retardation analysis. 
Neither formed detectable low-mobility complexes (data not 
shown). This result, together with the competition experiments 
(above), indicates that the flanking regions surrounding M- 
CAT core motifs are important for the formation of low- 
mobility protein-DNA complexes. We tentatively conclude 
that the flanking sequences of each M-CAT motif of the cTNT 
promoter interact with a distinct protein(s) that is likely to be 
involved in the muscle-specific activity of these elements. 

Contribution of the E box to the formation of LMC lA and 
IB. Since a mutation within the nominal E box had a profound 
effect on the tissue specificity of the M-CAT-1 element, we 
tried to compete for LMC lA and LMC IB with excess oligo- 
nucleotide containing the E-box mutation and found that it 
was unable to compete well for either complex (Fig. 3A, lane 
7). We next investigated the competition ability of the well- 
characterized, muscle-specific MCK E box (defined as a high- 
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FIG. 3. Protein complexes bind in a sequence-specific manner lo (he S'-flanking sequence of the M-CAT- 1 clement, (A) The 23-bp M-CAT- 1 element was 
radioliibcllccl and sul>jcctcd to gel retardation analysis in the presence of skeletal muscle nuclear extract. Prutein-DNA complexes were challenged with a 250-fold excess 
of unlabcltcd competitor oligonucleotides os labeled above the lanes. TTic TEF-l/M-CAT complexes, CI, C2, and C3, arc labelled on the right, as ore LMC I A and 
LMC IB. Intermediate-mobility, lower-intensity complexes that appear to show simitar sequence specificity (o LMC IB upon competition were also formed. Their 
significance is not clear. (B) Comparative gel retardation analysis of the M-CAT-1 element and the MCK E box in the presence of skeletal muscle nuclear extract. Gel 
retardation analysis of M-CAT-1 is shown in lanes 1 to 4. Analysis of the MCK E-box is shown in lanes 5 lo 9. Labelled probes were adjusted to equal specific activity, 
and lanes shown are taken from a single exposure of the same experiment to demonstrate comparative avidities of complex formation. Protcin-DNA complexes were 
challenged with unlalK-lled competitor oligonucleotides: lanes 1. 4, 5, and 9, none; lanes 2 and 6. 250.fold molar excess of MCK E-box; lanes 3 and 7, 250-fold molar 
excess of M-CAT-1 element; lane 8. 750-fold molar excess of M-CAT-1 element. Chicken skeletal muscle nuclear extracts in lanes 4 and 9 were preincubaied with 6 
|ig of anti-mouse myoD antiserum. TEF-l/M-CAT complexes, CI, C2, and C3. are labelled to the left of the figure, as arc LMC lA and LMC IB. Tlie presumptive 
E-box-MDP ctmiplex is labelled with an open arrow to the right of the figure, and the tertiary chicken myoD/amyoD antibody supershift is labelled with an open 
arrowhead to the right of the Hgure. 



affinity myoD:E12-binding site [5, 6, 59)). LMG lA was com- 
peted for, whereas LMC ] B was unaffected (Fig, 3A, lane 8, 
and Fig. 3B, lane 2). Thus, LMC lA and LMC IB rcsuh from 
the binding of proteins with distinct sequence specificities. 

Since LMC 1 A was competed for by a canonical MDF E box. 
we investigated whether it might contain an MDF component. 



DNA-protein complexes formed between the MCK E box and 
muscle nuclear extracts have similar mobility to LMC 1 A (Fig, 
3B, compare lanes 1 and 6). If these complexes are formed by 
interaction with the same protcin(s), the affinity of the M- 
CAT-1 E box is clearly lower than that of the MCK E box, as 
judged by the relative intensity of the relevant bands (Fig. 3B, 
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FIG. A, Protein complexes bind in ;i sequence-specific manner to the 5'- 
llanking sequence of the M-CA'I'«2 element. 'Vhc 23-bp M-CAT-2 element (se- 
quence shown in i*ig. lA and I able I) was radiolahelled and subjected to gel 
retardation analysis in the presence of skeletal muscle nuclear extract- Protein- 
DNA complexes were challenged with a 250-fold excess of unlalicllcd competitor 
oligunuclcuitdcs as indicated above the lanc$. The high>mobility TEF*I/M*CAT 
complexes. Cl, C2 »nd O. arc labelled to the right of the ligurc. as arc LMC 2A 
and LMC 2B. 



compare lanes 1 and 5). However, whereas the MCK E-box- 
chicken muscle extract complexes were supershifted in part 
with anti-murinc niyoD antibody (lane 9), LMC lA was unaf- 
fected (lane 4), suggesting that chicken myoD may not partic- 
ipate in LMC 1 A formation. Moreover, LMC lA is formed in 
several nonnniscle tissues (Fig. 5, lane 5, and data not shown). 
Tlius, while LMC 1 A shares the DNA sequence-binding spec* 
ificity of bHLH proteins, it is unlikely to belong to the muscle- 
specific myoD family of bHLH proteins. 

LMC 1 A is a low-intensity complex compared with LMC 1B» 
and hence its functional contribution may be similarly small. 
Moreover, LMC lA is not found in several nonmuscle tissues 
tested (Fig. 5 and data not shown), inconsistent with a key role 
for this activity in repression of M-CAT elements in such 
tissues. We tentatively conclude that LMC lA is not the activ- 
ity responsible for the repression in nonmuscle cells. 

LMC IB, by contrast, is a reasonable candidate for the 
factor(s) that interacts with the 5'-flanking sequence of the 
M-CA T-l element to mediate the repression of its activity in 



TABLE I . Oligonucleotides used in the construction of M-CAT 
muliimcr-hused synthetic promoters and as 
compel ilor oligonucleotides 



Name 


Sense strand sequence** 


Length 
(ni) 


M-CAT- 1 


•PGCA.-.OTG'rTGCATTCCTCTCTG 


23 


M-CAT- Imt 


TGCA.nGTGTTG£cccaSICrCTG 


23 


M-CAr-2 


TGCGCCGGGCACATTCCTGCTGC 


23 


M-CAT-2mt 


TGCGCCGGGCACcccaCTGCTGC 


B 


M-CATX2 


CCGAGAGACGCA'ITCCTCGCATTCCTCTGC 


30 


M-CAT/SV40 


TGCCTGACACACATrCCTCAGCT 


23 


M-CAT- lEmi 


TGCa.:;CccTTGCATTCCTCTCTG 


23 


M-CAT- 1/2 


TGCAAGTGTTGCATTCCTGCTGC 


23 


M-CAT-2/l 


TGCGCCGGGC AC ATTCCTCTC zTG 


23 


MCK E box 


GATCCGGCTCAGGCAGCAGGTGTTGG 


26 



*' Cohesive terniiiii are denoietl by }H)ldruce type, ihe core M-CAT motir(s) is 
imlieaicil by underlining, and nniiaicd bases are dem>led by lowercase type. The 
MCK \l \w\ (Mi£F-l site) sequence is derived from the right'hand site of the 
iiiiisele creatine kin:ise (MCK) enhancer (6). 
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PROBE: M-CAT-2 M-CAT-I 

Tissue: ^ ^^"^ ^ ^ 




1 2 3 4 5 6 

RG. 5. Tissue distribution of the nuclear factors binding the M-CAT- 1 and 
M-CAT-2 elements. Gel retardation analysis of DNA-proicin complex formation 
on the M.CAT-2 probe (lanes 1 to 3) and on the M-CAT- 1 probe (lanes 4 to 6) 
in the presence of nuclear extracts from skeletal muscle (Skel. M.). brain, and 
lung tissue, T\\c high-mobility TEF-I/M-CA T complexes. CI, C:2, and C3. arc 
labelled to the right of the figure, as are LMC I A and LMC IB. formed on the 
M-CAT- 1 probe, LMC 2A and LMC 2B, formed on the M-CAT-2 prol>c, arc 
labelled lo the left of the figure. 



nonmuscle tissues. LMC 1 B can be detected in nuclear extracts 
from all nonmuscle tissues tested (Fig. 5 and data not shown). 
LMC IB cannot be competed for by a consensus E box (Fig. 
3A, lane 8; Fig. 3B, lane 2), nor is it recognized by the mouse 
anti-myoD antiserum (Fig. 38, lane 4); therefore, we conclude 
that this complex is generated by a non-MDF factor with se- 
quence specificity of binding distinct from that of MDFs as well 
as of LMC lA. Interestingly, LMC IB derived from muscle 
appears to consistently migrate fractionally faster than LMC 
IB derived from nonmuscle tissue (Fig. 5, compare lane 6 with 
lanes 4 and S). 

Methylation interference footprints of low-mobility com* 
plexes. We used methylation interference footprtnting to com- 
pare the DNA-protein interactions in low-mobility complexes 
with those in the high-mobility complexes, Cl, C2 and C3, that 
contain TEF-i proteins bound to the core M-CAT motif (14a). 
Methylation of G^'* and G^' (antisense strand), at the center of 
the core M-CAT motif, abolished formation of Cl, C2, and C3 
on both M-CAT- 1 and M-CAT-2 elements (Fig. 6 A, compare 
lanes 5 to 7 with free M-CAT-1 probe in lane 8; Fig. 6B, 
compare lanes 13 to 15 with free M-CAT-2 probe in lane 16). 
In the case of M-CAT- 1, methylation of residue G''^ (sense 
strand) immediately 5' of the M-CAT motif is also incompat- 
ible with formation of complexes Cl, C2, and C3 (Fig. 6A, 
compare lanes 1 to 3 with free probe in lane 4). Thus, inter- 
action of a TEF-1 protein(s) with the M-CAT clement is sen- 
sitive to G methylation over a narrow area centered on the 
core of the M-CAT motif. 

Analysis of the low-mobility complexes indicates a broader, 
more complex footprint (Fig. 7). LMC lA formation is abol- 
ished by methylation at residue G*" and reduced by methyl- 
ation at residues G"* and G'^ on the sense strand (Fig. 7A, 
compare lane 3 with free probe in lanes 1 and 4). LMC 1 A is 
abolished by methylation at residues G'' and G on the anti- 
sense strand and slightly reduced by methylation at residues 
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M-CAT-I 



B 



sense 

01 C2 C3 F 



antlsense 

C1 02 03 F 



M-CAT-2 

sense antisense 

01 02 03 F 01 02 03 F 



26 
23 



*- -«* 16 



m^mm^m 16 



12 



10 



W 23 
* 25 



12 
U 



20 
21 



24 




■•^^■^•pw- iiir 27 



29 



30 



12 3 4 



5 6 7 8 



M PAT 1 TCGGCACAAGTGTTGCATTCCTCTCTGGGTCGG 
J^^'zSi 2^ 5 10 15 20 25 30 

C1, C2, C3 AGCCGTGTTCAC AACGTAAGGAGAGACCCAGCC 



TCGGGGCGCCGGGCACATTCCTGCTGCTCTCGG 
5 10 iJ 20 23 30 

C1, C2, C3 AGCCCCGCGGCCCGTGTAAGGACGACGAGAGCC 



M-CAT-2 



G^' and G*'' (Fig. 7A, compare lane 7 with free probe in lanes 
5 and 8). Thus, the LMC 1 A meihylation interference pattern 
is consistent with a protein(s) binding to the E-box region (Fig. 

7C). 

The LMC IB pattern of sensitivity to G meihylation is dis- 
tinct from that seen with either CI, C2, and C3 or LMC lA. 
LMC IB formation is abolished by methylation of G*"* and G'^ 
and reduced by methylation at G'" or G^ on the sense strand 
(Fig. 7A, compare lane 2 with free probe in lanes 1 and 4). 
Methylation of G'^' on the antisense strand abolished LMC IB 
formation, and methylation of G^'^ reduced its formation (Fig, 
7A, compare lane 6 with free probe in lanes 5 and 8). Thus, the 
LMC IB footprint encompasses the 5' end of the M-CAT 
motif as well as the flanking sequence immediately upstream, 
including part of the E-box consensus (Fig. 7C). 




13 14 15 16 



FIG. 6, Methylation interference analysis of the TEF-I/M-CAT complexes, 
CI, C2, and C3. (A) A 33-mcr M-CAT-1 probe was radiolal>clicd on the sease or 
antisense strand, and muscle nuclear protein-DNA interactions within ihc TEF- 
1/M-CAT complexes, CI, 02 and C3, were examined hy methylation interference 
analysis. Cleavage patterns of CI-, C2% and C3-bound probe are .shtwn m lanes 
I to 3 (scasc .strand) and lanes 5 to 7 (antisense strand), respectively. The 
cleavage patterns for unbound probe arc shown in lanes 4 (sense strand) and 8 
(antisense strand). A summary of the mcthylalcd G residues which abolish com- 
plex formation is shown to the right of each panel (solid circles). Residue num- 
bering is from the 5' end of the duplex 33-mcr probes, as shown in panel C. (B) 
A 33-mer M-CAT-2 probe was radiolabcllcd on the sense or antisense strand, 
and muscle nuclear protein-DNA interactions within the TEF-I/M-CAT com- 
plexes, CI, C2, and C3, were examined by methylation interference analysis. Cleav- 
age patterns of C1-, C2- and C3-bound probe arc shown in lanes *) to II (sense 
strand) and lanes 13 to 15 (antisense strand), respectively. The cleavage pattern 
for unbound probe is shown in lanes 12 (sense strand) and W> (antisense strand). 
A summary of the methylated G residues which abolish complex formation is 
shown 10 the right of each panel (solid circles). (C) Summary of the mcthyhtlion 
interference analysis of TEF-I/M-CAT complexes. Cl. C2, and C3. formed on the 
M-CAT- 1 and M-CAT-2 elements of the cTNT promoter. G residues whose 
meihylation abolished complex formation arc dcnoied by solid circles. Number- 
ing is from the 5' end of the double-stranded oligonuclet»tides. The M-CA'I'-l 
oligonucleotide spans hp -109 to -83 of the cTNT pronuilcr. and the M-CAT^2 
oligonucleotide spans bp -85 to -5^. 



Formation of LMC 2B is incompatible with methylation of 
G*^ and is reduced by methylation at G\ G'\ G^ G' \ or G'' 
on the .sense strand (Fig. 7B, c(>mp:jrc iunc 10 with free probe 
in lane 9). LMC 2B formation is abolished by methylation of 
G-" and reduced by methylation of G"*^, G' , or G"* on the 
antisense strand (Fig. 78, compare lane 12 with free probe in 
lane 1 1). Thus, the LMC 2B methylation interference pattern 
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LMC1A 



LMC1B 



LMC 2B 



o mo 

TCGGCACAAGTGTTGCATTCCTCTCTGGGTCGG 
S 10 15 20 25 iO 

AGCCGTGTTCACAACGTAAGGAGAGACCCAGCC 
• • O O 



TCGGCACAAGTGTTGCATTCCTCTCTGGGTCGG 

5 10 19 30 29 )0 

AGCCGTGTTCACAACGTAAGGAGAGACCCAGCC 
• O 



oo o •O o 

TCGGGGCGCCGGGCACATTCCTGCTGCTCTCGG 
5 10 15 20 25 30 

AGCCCCGCGGCCCGTGTAAGGACGACGAGAGCC 
O O • O 



FIG, 7. Mclhylation interference analysis of LMC lA, LMC IB, and LMC 
2B. (A) A 33-mcr M-CAT*! probe was radiolabelled on ihc sense or antisense 
strand, and muscle nuclear protcin-DNA inicractions within LMC lA and LMC 
IB were examined by mclhylation interference analysis. Cleavage patterns of 
LMC IB-bound probe are shown in lanes 2 (sense strand) and 6 (antisense 
strand); cleavage patterns of LMC 1 A4x>und probe arc shown in lanes 3 (sense 
strand) and 7 (antiscase stnmd). The cleavage pattern for unbound probe is shown 
in lanes 1 and 4 (scase strand) and lanes 5 and 8 (ajuisense strand). A summary of 
the methylated G residues which interfere completely (solid circles) and partially 
(open circles) with LMC IB formation is shown to the right of each panel, and 
a simitar summary for LMC lA is shown to the far right of each panel. The 
complete and partial scores result from comparison of three independent expcr- 
imcnus for LMC IB and two experiments for LMC lA. (B) A 33-mer M-CA'r.2 
probe ms radiolabcllcd on the sense or antisense strand^ and muscle nuclear 
protein-DNA interactions within LMC 2B were examined by methylaiion inter- 
ference anal>'sis. Cleavage patterns of LMC 2B-bound probe are shown In lanes 
10 (sense strand) and 12 (antisense strand). The cleavage pattern for unbound 
probe is shown in lanes 9 (sense strand) and 1 1 (antisense strand). A summar)' 
of the methylated G residues which interfere a)mpletcly (solid circles) and 
partially (open circles) with LMC 2B formation is shown to the right of each 
panel. Complete and partial scores result from comparison of three independent 
experiments. (C) Summary of the mcthylation interference anal>'sis of LMC lA, 
LMC IB. and LMC 2B formed on the M>CAT*1 and M-CAT-2 elements of the 
cTNT promoter. C residues whose mcthylation abolished or partially interfered 
with complex formation arc denoted by solid and open circles, respectively. 
Numbering is from the 5' end of the double-stranded oligonucleotides. 
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Skeieial Muscle 

Nuclear Extract LMC 1 B ^ LMC 2B 

m^tt "3^1 '^^^i- _57kD 

mmmpppr * 52ko 

1 2 3 

FIG. Wcsicrn bh>l analysis of proteins clulcd from LMC I B and LMC 2B. 
Lanes: I , S ^l,g iif unfractionatcd skeletal muscle nuclear proteins run on the same 
gel for comparist)n; 2, polypeptides cluted from the LMC IB protein/M-CAT-1 
complex; X polypeptides clulcd from the LMC 2B prolcin/M-CAT-2 complex. 
Detection of TEF-I in these complexes is specific* since un intcrmcdlatcmobtlity 
complex formed in brain nuclear extract was tested and contained no TEF-l 
proteins hy this assay. The estimated sizes in kilodaltons of the TEF-l polypep- 
tide hands are indicated to the right of the panel. 



encompasses the entire M-CAT motif as well as the proximal 
part of the 5'-flanking sequence (Fig. 7C). The experiments 
shown were performed with skeletal muscle extract; however, 
essentially identical results were obtained when a brain nuclear 
extract was used (data not shown). 

The LMC IB and LMC 2B footprints partially overlap the 
M-CAT core motif, although the sensitivity to methylation of 
antisense residue G^* (i.e., the 3' end of the motif) is less pro- 
nounced in the low-mobility complexes than in the high-mo- 
bility complexes, CI, C2, and C3. This could indicate that the 
protein(s) in LMC IB and 2B displaces the TEF-l proteins 
bound to the M-CAT core motif by virtue of an overlapping 
recognition site or, alternatively, that TEF-l proteins partici- 
pate in the formation of the low-mobility complexes with a 
coincident alteration in their interaction with the core M-CAT 
motif. This led us to ask if the TEF-l proteins present in the 
major high-mobility complexes CI, C2, and C3 arc also present 
in LMC IB and LMC 2B. 

Higher-order complexes contain TEF-l polypeptides. Im- 
munoblotting with a polyclonal antibody broadly reactive 
against TEF-l proteins revealed three TEF-l -related polypep- 
tides in extracts of striated muscle (Fig. 8, lane 1). These three 
polypeptides are also isolated from gel shift high-mobility com- 
plexes CI. C2, and C3 (14a). To determine if TEF-l proteins 
are present in the low-mobility complexes, LMC IB and LMC 
2B were gel isolated and subjected to sodium dodecyl sulfate- 
polyacrylamide gel electrophoresis and immunoblotting with 
the anti-TEF-1 antibody. LMC lA was insufificiently abundant 
to be analyzed in this way. However, both LMC IB and LMC 
2B contained the same three TEF-l polypeptides that are 
found in unfractionatcd muscle nuclear extracts (Fig. 8, com- 
pare lanes 2 and 3 with lane 1). We conclude from this exper- 
iment that TEF-l proteins are components of the low-mobility 
complexes and therefore that TEF-l proteins interact with 
other proteins to form higher-order complexes on the 
M-CAT- 1 and M-CAT-2 elements. 

DISCUSSION 

The present work shows that muscle-specific M-CAT ele- 
ments are complex protein-binding sites that depend upon 
both a core M-CAT motif (CATTCCT) and the flanking se- 
quences surrounding that motif. The core motif is a binding 
site for TEF-l proteins, and this binding interaction is essential 
for the function of M-CAT elements. Flanking sequences mod- 
ulate expression directed by M-CAT motifs. Non-muscle-spe- 
cific artificial promoters were obtained when flanking se- 
quences from the SV40 enhancer were used to surround a 
canonical core M-CAT motif or when intervening flanking 
sequence was removed (Fig. 2B). This explains why M-CAT 



elements from the SV40 genome are active in HeLa, PI 9, and 
F9 cells and in keratinocytes (11, 16, 22, 42, 60), as well as the 
nonmuscle cell types used here. Strong non-muscle-spccific 
promoters were produced when the spacing, centcr-to-center, 
between M-CATs was 23 bp or greater (Fig. 2B, bottom panel, 
and data not shown). The promoter produced was weaker 
when the M-CAT elements were closely apposed (Fig. 2B, top 
panel). 

Muscle-specific artificial promoters were obtained, on the 
other hand, when flanking sequences from the cTNT promoter 
were used to surround canonical core M-CAT motifs (Fig. IB 
and C). Flanking sequence appears to be responsible for me- 
diating repression of the cTNT-derivcd M-CAT elements in 
nonmuscle cells. For example, the M-CAT-l/Emt variant con- 
tains a mutation within the E-box site found in the 5 '-flanking 
sequence of the cTNT M-CAT- 1 element. The promoter de- 
rived by multimerization of this element was efficiently ex- 
pressed in liver and fibroblast cells (Fig. 2D), unlike the native 
cTNT promoter or artificial promoters containing native flank- 
ing sequence, which were inactive in these cell types. 

M-CAT element-dependent promoters are also absolutely 
dependent upon an intact core M-CAT motif for activity, in- 
dicating that protein interaction with the core M-CAT motif is 
a minimum requisite for function of the compound element 
(16, 37, 38; also see above). M-CAT core motifs are binding 
targets for TEF-l proteins (14, 20, 22, 27, 35, 49, 51, 58). The 
dependence of tissue specificity upon appropriate flanking se- 
quences might result from the binding of additional proteins, 
which modulate the activity of TEF-l proteins bound to the 
M-CAT core motif. Alternatively, the modulation might be 
mediated by a change in interaction between bound TEF-l 
proteins and the different flanking sequences. 

To distinguish between the above possibilities, we looked for 
specific interactions between flanking sequences and proteins 
present in muscle and nonmuscle extracts. We found that any 
M-CAT element variant that contained an intact core M-CAT 
motif, regardless of flanking sequence, could compete for the 
three high-mobility complexes, CI, C2, and C3, although the 
tissue-specific activity of the elements varied widely (Fig. 3 and 

4) . Therefore, we conclude that this binding to the core motif 
alone is probably insufficient to confer cell-specific regulation. 
However, gel retardation assays revealed that higher-order 
complexes also form on M-CAT- 1 and M-CAT-2 elements. 
The formation of higher-order complexes depends on specific 
cTNT flanking DNA sequences because M-CAT elements with 
non-cTNT-derived flanking sequence neither form such high- 
er-order complexes (data not shown) nor compete for their 
formation (Fig. 3 and 4). Moreover, the ability of specific 
flanking sequences to form and compete for higher-order com- 
plex formation (Fig. 3 and 4 and data not shown) directly 
correlates with their ability to support muscle-specific tran- 
scription in artificial promoters (Fig. 1 and 2). 

Different higher-order complexes form on M-CAT- 1 and 
M-CAT-2 elements, as judged by their distinct mobilities (Fig. 

5) and the inability of the probes to cross-compete (Fig. 3 and 
4). Competition experiments with chimeric M-CAT elements 
indicate that low-mobility complex formation depends on the 
5' sequence of cTNT M-CAT elements (Fig. 3 and 4). More- 
over, competition experiments with the M-CAT-l/Emt variant 
indicate that a localized 3-nucleotide mutation in the 5'-flank- 
ing sequence of M-CAT- 1 disrupts binding of LMC I A and 
LMC IB (Fig. 3). These data arc consistent with the notion 
that additional proteins bind in a sequence-specific manner to 
the flanking regions of cTNT-derived M-CAT elements to 
form LMC lA, LMC IB, and LMC 28. 

LMC 1 A and LMC IB both appear to result from the bind- 
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ing of a protein at, or very near, the E box, because a 3-nucle- 
otide mutation in this site in M-CAT-l/Emt greatly reduces the 
ability of that clement to compete for either complex (Fig. 3A, 
lane 7). LMC 1 A formation appears to result from binding of 
a factor to the E box itself, since the mcthylation interference 
footprint derived for LMC 1 A is strongest over the E box (Fig. 
7A and C) and the MCK E box can compete for this complex 
(Fig. 3). However, the limited tissue distribution and low rel- 
ative intensity of LMC 1 A lead us to tentatively conclude that 
its functional significance is likely to be minor. 

LMC IB has a binding specificity distinct from LMC lA and 
MDFs, since it is not competed for by the MCK E box (Fig. 3). 
This complex is formed in all tissues examined (liver, kidney, 
gizzard, lung, brain, and skcletial and cardiac muscle [Fig. 5 and 
data not shown]). LMC IB is therefore also unlikely to contain 
MDF family members. Its methylation interference footprint 
encompasses the 5' end of the core M-CAT motif and the 3' 
end of the E box (Fig. 7A and C). LMC IB competition 
experiments with mutations through the 5'-flanking DNA of 
the M-CAT-1 clement support the notion that LMC IB rec- 
ognizes a site overlapping but distinct from the E box, in 
agreement with the mcthylation interference data presented 
here (31a). The LMC IB complex may mediate repression of 
the M-CAT- 1 element in nonmuscle cells because a mutation 
that disrupts its formation correlates with increased M-CAT- 
dependcnt promoter activity in nonmuscle cells (Fig. 2D). It 
has been noted that LMC IB derived from skeletal muscle 
tissues appears to migrate fractionally faster than does non- 
muscle-derivcd LMC IB (Fig. 5). We hypothesize that this 
slight difference in electrophoretic mobility is indicative of a 
posttranslational modification, such as phosphorylation, of a 
component of the complex that permits a muscle-spccific re- 
lease of the repression. 

The M-CAT-2 low-mobility complex, LMC 2B, has charac- 
teristics similar to those of LMC IB. Besides having similar 
electrophoretic mobility and broad methylation interference 
footprints, both contain TEF-1 proteins (see below) and nei- 
ther is competed for by M-CAT elements containing non- 
cTNT-flanking regions. Like LMC IB, LMC 2B is likely to be 
involved in mediating the cell specificity of the M-CAT-2 ele- 
ment, because LMC 2B is generated by sequence-specific in- 
teraction of a nuclear protein(s) with the M-CAT-2 flanking 
region and this flanking region is required for the tissue spec- 
ificity of M-CAT-2. Nevertheless, the flanking sequences rec- 
ognized in each M-CAT clement are quite different, and M- 
CAT-l and M-CAT-2 do not cross-compete for low-mobility 
complex formation. Thus, either LMC IB and LMC 2B con- 
tain different additional proteins or, if the same protein(s) is 
involved in the formation of both complexes, it must interact 
with the flanking sequences in a substantially different manner. 

Methylation interference footprinting of LMC IB or LMC 
2B gave a broad footprint overlapping both the M-CAT site 
and the immediate 5'-flanking sequence of both M-CAT-1 and 
M-CAT-2 elements (Fig. 7). Moreover, each of the three im- 
munologically related forms of TEF-1 protein found in muscle 
nuclear extracts was also found to be present in LMC IB and 
LMC 2B (Fig, 8). However, the interaction of TEF-1 proteins 
with the cognate M-CAT-binding motif was subtly altered in 
the low-mobility complexes in that the mcthylation footprints 
were shifted toward the 5' region of the element, compared 
with the TEF-I footprint of the high-mobility complexes, CI, 
C2, and C3 (compare Fig. 6C and 7C). We conclude that LMC 
IB and LMC 2B consist of higher-order complexes of TEF-1 
proteins and other auxiliary proteins. Low-mobility complexes 
may consist of multiple species with distinct TEF-1 protein 
constituents (as do the high-mobility bands, CI, C2, and C3 
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FIG. 9. Diagrammatic representation outlining the mechanism proposed for 
the tissue specificity of M-CAT elements containing cTNT-dcrived flanks. (A) 
Simple M-CAT elements bind TEF-l proteins but have flanking sequences thai 
do not bind additional factors. Such elements direct non-ccll-spccific expression. 
(B and C) Complex M-CAT elements bind TEF-l proteins but have flanking 
sequence that binds an additional protcin(s) (X/X'). The complex of TEF-1 
protein(s) and X in nonmuscle cells is inactive (B). whereas the complex of 
TEF-1 protein(s) and X' in nonmuscle cells is active (C). The difference between 
the complexes in panels B and C may lie in modifications (»f TEF-l proicin(s) or 
the X component (shown here as X versus X') or diflferences in the primary 
components or stoichiometric balance within them. 



(14a]), but size differences in this range may not be resolved 
under these gel shift conditions. The auxih'ary proteins may 
differ between LMC IB and LMC 2B, but both alter the bind- 
ing of TEF-1 proteins to their cognate motif in a similar man- 
ner. The displacement of the TEF-1 footprint in these higher- 
order complexes may be a measure of the tightness of the 
interaction between TEF-1 proteins and the auxiliary pro- 
tein(s), such that TEF-I proteins are tilted off their cognate 
site toward the 5' binding site of the second protein (Fig. 7C). 
Alternatively, the altered footprint may represent changes in 
the combination of TEF-1 proteins bound on these elements. 

In summary, muscle-specific M-CAT elements are a distinct 
subset of active M-CAT elements. Simple M-CAT elements, 
such as those found in the SV40 promoter, direct non-cell- 
specific gene expression. These consist of M-CAT core motifs 
that bind TEF-1 proteins and flanking sequences that do not 
bind protein (Fig. 9A). Muscle-specific M-CAT elements, on 
the other hand, consist of a core M-CAT motif that binds 
TEF-1 proteins plus flanking DNA that binds an additional 
protein(s) in a sequence-specific fashion. We hypothesize that 
the second binding determines whether the element is nega- 
tively regulated (in nonmuscle cells [Fig. 9B]) or positively 
regulated (in muscle cells [Fig. 9C|). Neither the identity nor 
the diversity of potential secondary binding proteins is known 
at this time (see below). 

The model of M-CAT element-dependent muscle-specific 
transcription outlined above differs markedly from the mech- 
anism by which E boxes arc believed to activate muscle-specific 
transcription. E-box-binding factors (MDFs) arc themselves 
muscle specific and, once expressed, are potentially capable of 
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MUSCLE-SPECIFIC M-CAT ELEMENTS 
TABLE 2. M-CAT elements and their flanking DNA in other musclc-spccific promoters 
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Promoter and position 



M-CAT and flanking sequence" 



Tissue" 



Rcfcrcnce(s) 



cTNT 
Chicken (-66S) 
Chicken (-89S) 

pMHC 
Human (-284AS) 
Rabbit (-264AS) 
Rat (-267AS) 
Rat (-1%S) 

aMHC 

Rat (-236 AS) 

Rat (-42AS) 
Myosin light chain 2 

Chicken (-35AS) 
s/cTNC 

Mouse (-50 AS) 
VSM a-actin 

Mouse (-175 AS) 
Skin a-actin 

Chicken (-6()S) 

Mouse (-63S) 
AChRp 

Rat (-43S) 
PGAM-M 

Human (— 72AS) 
c-mos 

Rat (-1561S) 

Rat (-1541S) 
AE3 Cr/HCO,- channel 

Rodent/human (-190S) 
Brain natriuretic peptide 

Rat (-101 AS) 

Human (-117S) 



GCGCCGGGCACATTCCTGCTGCTCTGC 
ACAAGIGTTGCATTCCTCTCTGGGCGC 



CCAGGCCTCACATTCCACAGCIGGCAG 
TATCTCCTCGCATTCCACTGCCTGTGG 
ATATCCCTTACATTCCACAGCTCAC 
CTGAACATGCCATACCACAACAATGAC 



TCCTTGGGCACArrCCTCCTCCCCAAA 
CTTTATAGCTCATTCCACGIGCCTGCT 

AAATACACCCCATTCCAGGCTAAAAAT 

CCTCCTGCTACATTCCCAGCCCAGCCC 

TCrrCCACTGCATTCCTCTGTTCTGCT 

AGCTTGCCGCATTCCTGGGGGCCGGG 
AGGGCAGCAACATTCTTTCGGGGCGGT 

ACAGGTGCACATTCCTGGGCGCCTCG 

TGCCAATCAGCATTCCAGGCGGTGGCA 

AGGCTTTATCCATTTCTGAGATAAAGA 
ATAAAGATTTCATTTCTAATCTCAGTA 

GATCTCGGCACATTCCrrCCATCTTAT 

TTATCAGACACATTCCTGCCTGCTGAG 
CGGAGGGGCTCATTCCCGGGCCCTGAT 



Sk, C 
Sk, C 



C 
Sk 

Sk, C 

C, T PKC 



Sk, C 

C, t cAMP 

c 

VSM, t scrum 

C, t TGF-p 
t PKC 

Sk 

Sk,C 

Sk 
Sk 



C 



36 
37 



15 
48 
55 
28 



40 
20 

46 

44 

7, 52 

35 
29 

4 

41 

32 
32 

33 

18 
31 



" M-CAT motifs are shown in boldface text. Ten nucleotides of sequence flanking the M-CAT on each side is shown in every case for which this information is 
published. Nominal E-box sequences arc underlined. ....... , i 

" Sk C or VSM indicates the tissue type in which the M-CAT-containing promoter was tested: skeletal myocytes or derived cell Imes, cardiomyocytcs, and vascular 
smooth muscle cells, respectively. Other notations refer to M-CATs which have been demonstrated to be the target response elements for the stimuli noted: cyclic AMP 
(cAMP), protein kinase C (PKC), or transforming growth factor beta (TGF-P). 

** M-CAT element whose function has not yet been shown. 



activating all appropriate E-box targets de novo. M-CAT ele- 
ments, by contrast, bind TEF-1 proteins that are widely ex- 
pressed in many cell types but whose activity is modified by the 
interaction of a cofactor(s) that binds to specific flanking re- 
gions. This added level of complexity may provide greater 
flexibility in the regulation of M-CAT-dependent promoters, 
by virtue of the conditional and modulatable activity of these 
compound elements. 

The cTNT-derived M-CAT elements studied here were 
compared with other M-CAT elements implicated in the func- 
tion of muscle-specific promoters (Table 2). None of these 
M-CAT elements are precisely conserved with respect to the 
immediate flanking sequence; however, it is noteworthy that 
nominal E boxes appear close to M-CAT elements in the 
cTNT, p myosin heavy chain (pMHC), aMHC, and acetylcho- 
line receptor p (AChRp) subunit promoters (4, 15, 20, 38). 
Mutation of the E box flanking the distal M-CAT in the human 
PMHC promoter reduces cardiac activity to one-fifth (15). The 
individual requirement for the E box which flanks the M-CAT 
in the AChRp promoter has not been reported. 

The M-CAT element and E box in the aMHC proximal 
promoter are particularly interesting. In this promoter, the 
M-CAT element overlaps the E box, and this complex element 
mediates cyclic AMP inducibility and is required for basal 



cardiac muscle-specific promoter activity (20). On gel retarda- 
tion analysis, this element generates only a low-mobility com- 
plex, which is competed for by M-CAT elements and contains 
a TEF-1 protein(s) (20). The methylation interference foot- 
print of that complex is similar to the footprint derived here for 
LMC IB and LMC 2B (20), Thus, the auxiliary protein in LMC 
IB may be related or identical to that binding the E-box/M- 
CAT element of the aMHC promoter. 

Another E box, found in the AChRS promoter, is capable of 
mediating both activation in myotubes and repression in myo- 
blasts and fibroblasts (50). Unlike the M-CAT-1 E box, the 
AChRS E box binds myoD and myogcnin as well as a third, 
non-muscle-specific, unidentifiable factor. Furthermore, the 
AChR8 E-box is not found in association with another ex- 
acting sequence upon whose activity it depends (50); in con- 
trast, mutation of the core M-CAT motif in M-CAT-1, which 
leaves the E box intact, abolishes all promoter activity of the 
mutant element in muscle (data not shown). This difference 
may reflect the differential ability of these E boxes to bind 
MDFs as well as the presumptive repressor protein. Thus, the 
third non-MDF factor bound to the AChR8 E box may be 
related to the M-CAT-1 auxiliary factor found in LMC IB. 

The nominal E-box-binding site may constitute one con- 
served binding site for TEF-1 -associated cell-specific repres- 
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sors The M-CAT elements found in other cardiac promoters 
which do not attain E boxes are nevertheless probably subject 
to similar repression mechanisms. The lack of overt conserva- 
tion in these other M-CAT clement-flanking sequences and 
indeed between those of M-CAT-l and M-CAT-2 may point to 
the existence of a number of such auxiliary protems, perhaps 
each facilitating the targeting of different second-messenger 
cascades to TEF-l proteins. On the other hand, the diversity 
may reflect the ability of a single factor to bind a very loosely 
defined binding site once closely associated with a TEF-1- 
binding motif. 
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